Performance Evaluation of IntelR

نویسندگان

  • Richard M. Yoo
  • Christopher J. Hughes
  • Konrad Lai
  • Ravi Rajwar
چکیده

Intel has recently introduced Intel © Transactional Synchronization Extensions (Intel © TSX) in the Intel 4th Generation Core Processors. With Intel TSX, a processor can dynamically determine whether threads need to serialize through lock-protected critical sections. In this paper, we evaluate the first hardware implementation of Intel TSX using a set of high-performance computing (HPC) workloads, and demonstrate that applying Intel TSX to these workloads can provide significant performance improvements. On a set of real-world HPC workloads, applying Intel TSX provides an average speedup of 1.41x. When applied to a parallel user-level TCP/IP stack, Intel TSX provides 1.31x average bandwidth improvement on network intensive applications. We also demonstrate the ease with which we were able to apply Intel TSX to the various workloads.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-core Implementations of the Concurrent Collections Programming Model

In this paper we introduce the Concurrent Collections programming model, which builds on past work on TStreams [8]. In this model, programs are written in terms of high-level application-specific operations. These operations are partially ordered according to only their semantic constraints. These partial orderings correspond to data flow and control flow. This approach supports an important se...

متن کامل

Tera-Scale 1D FFT with Low-Communication Algorithm and IntelR

This paper demonstrates the first tera-scale performance of Intel © Xeon Phi TM coprocessors on 1D fft computations. Applying a disciplined performance programming methodology of sound algorithm choice, valid performance model, and well-executed optimizations, we break the tera-flop mark on a mere 64 nodes of Xeon Phi and reach 6.7 tflops with 512 nodes, which is 1.5× than achievable on a same ...

متن کامل

parallel_dp: The Parallel Dynamic Programming Design Pattern as an IntelR

Intel Threading Building Blocks (TBB) is an ideal environment for implementation of the parallel dynamic programming design pattern. The task-based parallelism of TBB readily lends itself to the realization of the participants and participant collaboration of this design pattern. We propose the parallel dp algorithm template, an implementation of the parallel dynamic programming design pattern ...

متن کامل

Factors Involved in Evaluation of Physical Education Teachers’ Performance

The aim of this research has been to help improve the performance evaluation of physical education teachers by identifying factors involved in such a task and exploring their relevance through a survey of such teachers in Tehran. A cluster sample of 384 physical education teachers was selected from among all such teachers in Tehran’s 19 districts divided into five geographic areas. Data was col...

متن کامل

The Role of Students’ Social and Academic Integration in Their Evaluation of Faculties’ Educational Performance Quality in Shiraz University of Medical Sciences

Introduction: The purpose of this study was to explore the relationship between students’ social and academic integration and their evaluation of the faculties’ educational performance quality in Shiraz University of Medical Sciences. Methods: This descriptive-correlational study was performed on all students of Shiraz University of Medical Sciences. The participants (n = 431) were selected thr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013